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Abstract 


In this paper we describe a general procedure for calculating equivalent sensor radiances 
from variables output from a global atmospheric forecast model. In order to take proper 
account of the discrepancies between model resolution and sensor footprint the algorithm 
takes explicit account of the model subgrid variability, in particular its description of the 
probably density function of total water (vapor and cloud condensate.) The equivalent sensor 
radiances are then substituted into an operational remote sensing algorithm processing chain 
to produce a variety of remote sensing products that would normally be produced from actual 
sensor output. This output can then be used for a wide variety of purposes such as model 
parameter verification, remote sensing algorithm validation, testing of new retrieval methods 
and future sensor studies. We show a specific implementation using the GEOS-5 model, the 
MODIS instrument and the MODIS Adaptive Processing System (MOD APS) Data Collection 
5.1 operational remote sensing cloud algorithm processing chain (including the cloud mask, 
cloud top properties and cloud optical and microphysical properties products.) We focus on 
clouds and cloud/aerosol interactions, because they are very important to model development 
and improvement. 
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1 Introduction 


Accurate knowledge of cloud cover and cloud properties is important in model studies 
that involve Earth’s radiative budget, climate prediction and numerical weather prediction. 
High clouds are observed to have a net warming effect on the atmosphere because of their low 
albedo and low temperature. Low clouds have a net cooling effect due to their high albedo 
and relatively small temperature contrast with the surface. Clouds and their interactions with 
aerosols are significant sources of uncertainty in climate prediction studies (IPCC, 2007). In 
addition, clouds continue to be the main source of climate feedback uncertainty and hence 
climate sensitivity (e.g., Bony et al., 2006). 

The Goddard Earth Observing System Version 5 (GEOS-5) earth system model is 
maintained by the Global Modeling and Assimilation Office (GMAO) at NASA Goddard 
Space Flight Center (GSFC). GEOS-5 contains components for atmospheric circulation and 
composition (including atmospheric data assimilation), ocean circulation and 
biogeochemistry, and land surface processes. Components and individual parameterizations 
within components are coupled under the Earth System Modeling Framework (ESMF, Hill et 
al. 2004). In addition to traditional meteorological parameters (winds, temperatures, etc., 
Rienecker et al. 2008), GEOS-5 includes modules representing the atmospheric composition, 
most notably aerosols (Colarco et al. 2010) and tropospheric/stratospheric chemical 
constituents (Pawson et al. 2008), and the impact of these constituents on the radiative 
processes of the atmosphere. GEOS-5 has a mature atmospheric data assimilation system that 
builds upon the Grid-point Statistical Interpolation (GSI) algorithm jointly developed with 
NCEP (Wu et al. 2002, Derber et al. 2003, Rienecker et al. 2008). The GSI solver was 
originally developed at NCEP as a unified 3D-Var analysis system for supporting global and 
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regional models. GSI includes all the in-situ and remotely sensed data used for operational 
weather prediction at NCEP. GEOS-5 also includes assimilation of Aerosol Optical Depth 
(AOD) observations from the MODerate resolution Imaging Spectroradiometer (MODIS) 
imager on the NASA Earth Observing System (EOS) Terra and Aqua spacecraft; an 
algorithm for assimilating cloud property infomiation from measurements in the visible and 
infrared portions of the spectrum is currently under development (Norris and da Silva 2013). 
While the GEOS-5 meteorological assimilation includes a wide variety of spaceborne sensor 
data, traditionally samples containing clouds are carefully screened out. The near real-time 
GEOS-5 data assimilation and forecasting system runs at a nominal horizontal resolution of 
25 km with 72 vertical layers (Rienecker et al. 2008, Molod et al. 2012). 

The MODIS instrument (Barnes et al. 1998) is a passive imager, producing a wide variety 
of remotely sensed data products for land, ocean and atmosphere disciplines from 36 spectral 
channels. Data Collection 5.1 processing includes algorithms for retrieving cloud cover 
amount (Ackerman, et al. 2006, Frey et al. 2008), cloud top properties such as cloud top 
pressure and temperature (Menzel et al. 2008) and cloud optical and microphysical properties 
such as cloud optical thickness, cloud effective radius and cloud water path (Platnick et al. 
2003; Wind et al. 2010; Zhang and Platnick 2011; King et al. 2013). 

In this paper we present a technique that brings together remote sensing methods and 
model-generated fields. We use MODIS geolocation data to sample GEOS-5 fields as if the 
MODIS instrument were flying over the model fields instead of the Earth surface. Once the 
sampling is complete, we generate equivalent sensor radiance data for the MODIS footprint. 
We then replace the contents of the 1 km, 500 m and 250 m resolution MODIS Level- IB 
(Xiong, et al. 2006) radiance files with these simulated radiances and insert the resulting 
alternate data stream into the start of the MODIS Adaptive Processing System (MODAPS) 
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operational algorithm processing chain for the atmosphere discipline cloud products 
mentioned above (product designation MOD06 and MYD06 for Terra and Aqua MODIS, 
respectively). The data stream is fully transparent to the system so that pixel-level (Level-2) 
retrievals can be aggregated through the same gridded (Level-3) l°xl° code (Hubanks et al. 
2006; King et al. 2003, 2013) used in MODAPS production (MOD08 and MYD08 for Terra 
and Aqua, respectively). There are many potential uses for the resulting Level-2 and Level-3 
data. Level-3 model aggregations can be compared to archived MODIS Level-3 and GEOS-5 
source data fields directly as a means of model validation and study of model biases that 
could exist. Level-2 data can be used to study some aspects of retrieval algorithm behavior 
and sensitivities since all retrievals are performed with known (prescribed) truth. 

The equivalent sensor data framework has been developed with instrument flexibility in 
mind, so that by simple substitution of spectral response functions and data reader, the 
MODIS instrument can be replaced by other spacebome or airborne sensors, currently in 
operation or part of a future concept, and a different sensor data stream can be produced. 
Thus, identical products from different sensors or different retrieval algorithms for the same 
sensor can be compared and analyzed in a controlled environment, which can provide insight 
and lead to improvements in remote sensing algorithms. 

This flexibility extends to model data as well. Any climate or weather prediction model 
fields can be used as long as a means of ingesting the necessary parameters is provided. Thus 
synthetic retrievals based on multiple models can be compared and analyzed using the same 
sensor interface in a controlled environment, leading to a consistent diagnostic toolset. 
Furthermore, this detailed simulation capability can function as a testbed for very fast 
simulators such as the Cloud Feedback Model Intercomparison Project (CFMIP) Observation 
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Simulator Package (COSP) (Bodas-Salcedo, et al. 2011) or the hyperspectral simulator of 
Feldman et al. (2011). 

In section 2 we describe the model-sensor interface using the GEOS-5 model and 
MODIS imager. Section 3 shows an example of simulation and retrieval of cloud properties 
on a sample Level-2 data granule. In section 4 we elaborate on future directions for the 
software suite. 

This paper is the first part in a series that will combine the software suite described in 
detail here with a variety of research applications. 

2 Radiance simulations at scales smaller than the model’s grid spacing 

We start the process by selecting an area and time period of study. It can be as small as a 
few-pixel subsection of a single MODIS granule or as large as an entire year of MODIS data. 
The study size is only limited by availability of computing resources. As far as model itself is 
concerned, there is no need for actual MODIS data to be present, but we specifically want the 
actual MODIS radiances to be available, so that retrievals from simulated and actual radiances 
can be compared directly. Similarities and differences in those retrievals can be analyzed and 
results applied on a variety of levels in order to improve both the model and the sensor 
retrieval algorithm. 

For simplicity’s sake in all subsequent references and illustrations the study area will be 
taken to be a standard 5 minute MODIS data granule (approximate 2000 km in the along track 
direction by 2300 km). Once the granule is chosen, we proceed to choose model output files 
that bound the granule time. For example, for the granule at 0200 UTC, we would select 
model output at 0000 and 0300 UTC. We use a MODIS standard geolocation file (MOD03 
product) to define the spatial locations to sample the model fields. Solar and view angle 
information contained in the same MODIS geolocation file is also used in the simulation . For 


6 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 


the examples shown in this paper we used the GEOS model v.5.7.2 output. A listing of 
specific GEOS-5 fields and products used in this suite is given in Table 1. 

2.1 Surface albedo determination 

In order to save on computational time we pre-determine surface albedo for the area of 
study. The surface albedo data comes from a variety of sources. Over ice-free ocean, MODIS 
geometry and model wind speed are used in a Cox-Munk ocean surface BRDF model (Cox 
and Munk, 1954) to produce cloud-free ocean surface reflectance. This model reflectance is 
calculated for four cardinal wind directions and then averaged into a lookup table (LUT), that 
is a function of wavelength, wind speed, solar and sensor zenith angles and relative azimuth 
angle. We do the calculation at three wind speeds of 3, 7 and 15 m/s. The LUT has 33 solar 
zenith values, 28 view zenith values and 37 relative azimuth values. We linearly interpolate 
as needed to obtain surface spectral albedo in the selected fifteen MODIS channels that have a 
shortwave reflective component. The MODIS channels used in the simulation and their 
central wavelengths are listed in Table 2. The ocean reflectance LUT contains data for 
channels 1-22 and 26 from the table. For the IR (infrared) channels that have no reflective 
component (27-36), a constant value of 0.015 is used. This value is based on the ocean 
surface emissivity value suggested by the MODIS cloud top properties algorithm. 

Over land several methods are utilized to model the radiances for all MODIS channels. 
We use the MODIS land surface spectral albedo gap-filled dataset (Moody et al. 2005, 2008) 
that has been updated for MODIS data Collection 6 and is derived from both Aqua and Terra 
data. In addition to providing 16-day time period averages every 8 days, the gap-fdled albedo 
files are generated for each year separately (instead of aggregating all years together as was 
done previously). Further, spatial resolution has been improved to 1 km. These files are 
derived from the Collection 5 MCD42B product (C. Schaaf et al., personal communication). 
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This MODIS land surface albedo product is used directly for channels 1-7 and interpolated 
linearly to cover other MODIS spectral channels that fall within the 0.47 - 2.14pm 
wavelength range. For wavelengths that are longer than 2. 14pm we use the surface emissivity 
dataset used in MODIS clear-sky profde retrievals (Seemann et al. 2008). For wavelengths 
shorter than 0.47pm we use a seasonally averaged surface albedo database utilized by the 
MODIS deep-blue algorithm (Hsu et al. 2004) to obtain the albedo for the 0.41pm channel 
and then interpolate for the 0.44 pm channel. 

Over snow and sea ice we use the MODIS zonal snow/ice albedo dataset (Moody et al. 
2007). The table lookup is determined by the MODIS pixel latitude, the International 
Geosphere-Biosphere Programme (IGBP) ecosystem type and snow/ice fractions from 
GEOS-5 FRSNO and FRSEAICE model fields. 

The resulting surface albedo values are written out to fde for each study area so that they 
can be referenced later for different simulation scenarios involving same area and time. A 
good example of such varying scenarios would be repeated experiments with or without the 
presence of aerosols. 

2.2 Water vapor and other gaseous absorbers 

After the surface albedo calculations are complete, we proceed to ingest the profdes of 
temperature, relative humidity, ozone concentration and atmospheric pressure. These profiles 
are downsampled to 27 atmospheric levels, as listed in Table 3, from the GEOS-5 native 72 
vertical levels and sent to an atmospheric transmittance module that uses the correlated-k 
method (Kratz, 1995) to calculate weights and optical thicknesses for each atmospheric layer 
due to water vapor and other gaseous absorbers. In cases where the surface is encountered at 
an altitude higher than 0 km, the profile is trimmed accordingly and the surface level is 
inserted as the last level to be used. We perform the vertical downsampling in order to save on 


8 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 


the computational cost of the equivalent sensor radiance simulation step, with the bulk of 
downsampling occuring above the tropopause. We preserve finer vertical resolution in the 
troposphere. We find this to be permissible as radiance data stored in a MODIS LIB file has 
an accuracy to only the 5 th decimal place and the uncertainty due to varying the number of 
vertical levels in the upper atmosphere is less than this data storage accuracy. 

2.3 Generating cloud subcolumns 

Sampling of model cloud-related fields to the MODIS pixel scale is not straightforward 
because cloud properties typically vary on scales not adequately resolved by the operational 
0.25° GEOS-5 resolution. To sample cloud fields, 1 km MODIS pixels for each GEOS-5 
gridcolumn are collected and the same number of pixel-like sub-columns is generated using a 
statistical model of sub-gridcolumn moisture variability. The general approach of Norris et al. 
(2008) is followed, namely using a parameterized probability density function (PDF) of total 
water content for each model layer and a Gaussian copula to correlate these PDFs in the 
vertical. 

In this application, we use the skewed triangle PDF, which allows a simple inclusion of 
moisture variability skewness, a ubiquitous feature of atmospheric boundary layers. This PDF 
has a simple scalene form characterized by three parameters: a lower and upper bound and a 
mode. Under some circumstances, these three parameters can be directly diagnosed from the 
layer mean total water and condensate contents, q, and q c , and cloud fraction/ but in many 
cases some adjustments are necessary to f and possibly q c , to achieve consistency. The details 
of this calculation are beyond the scope of this paper and are described fully in Norris and da 
Silva (2013). Approximations must also be made in the case of clear or overcast layers, when 
the triangle is under-determined. 
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For the Gaussian copula we use a correlation matrix with a fixed vertical decorrelation 
scale of 100 hPa, further modified by a multiplicative Riishojgaard (1998) flow-dependent 
correlation in total water that pemiits sharper decorrelation across inversion features. Further 
details are given in Norris and da Silva (2013). Once the correlation matrix is specified, the 
Gaussian copula correlated ranks of each of the gridcolumn’s layers are easily generated 
(Norris et al., 2008) and then inverted with the cumulative distribution function (CDF) of 
each layer’s skewed triangle distribution. The net result is an ensemble of subcolumns of total 
moisture content that sample the specified layer PDFs and have the specified vertical 
correlations and accompanying cloud and condensate overlap properties. The transformation 
of total moisture content to vapor, liquid water and ice contents assumes the vapor is capped 
at the GEOS-5 saturation vapor content and that the excess moisture is condensate, split 
between the phases using an ice fraction linear in temperature over the -35 to 0°C range. It is 
these subcolumn condensates, combined with GEOS-5 diagnostic effective radii, that are used 
to evaluate subcolumn (or “pixel”) liquid water and ice optical thicknesses for each layer. 
These are input to the MODIS radiance simulator code. 

Note that the subcolumns generated in this way are horizontally independent (the 
independent column approximation or ICA), but are subsequently “clumped,” or rearranged, 
to give horizontal spatial coherence, by using a horizontal Gaussian copula applied to 
condensed water path. This clumping acts to give the generated clouds a reasonable horizontal 
structure, such that the cloudy pixels in a gridcolumn are actually grouped into reasonable 
looking clouds, rather than being randomly distributed. This is important because the MODIS 
cloud optical and microphysical properties retrieval algorithm has some spatial variance tests 
for potentially partially-cloudy pixels, removing cloud edges by the so-called “clear-sky 
restoral” (Zhang and Platnick 2011; Pincus et al. 2012). If clumping is not used, then 
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individual points generated by ICA stand an exceptionally high chance of being eliminated by 
the clear sky restoral unless a model grid box has a nearly 100% cloud fraction. 

In practice, the clumping algorithm works as follows: a correlation matrix C is a 
generated between all pixels in a gridcolumn based on the distance between the actual pixels 
in the MODIS granule and assuming a nominal 5 km decorrelation length. If there are N 
pixels, C is an N X N matrix. This matrix is used by a Gaussian copula to generate N correlated 
ranks, which are subsequently used to sample (effectively, to re-order) a list of the N 
simulated pixels that has previously been sorted by column condensed water path (CWP). 
Because horizontally nearby pixels are more correlated by C, they will have a higher chance 
of having similar ranks, and therefore similar values of CWP. In this way the pixels are 
grouped together horizontally into coherent clouds. [Note that this clumping acts on 
subcolumns as a whole, and independent of the preexisting vertical correlations in the ICA 
subcolumns, so the clumping will work better for single cloud layers. For multilayer clouds, 
the layer that dominates the CWP will dominate the clumping.] 

The sub-gridcolumn cloud generator described above is, of course, only one of many 
possible generators. A less complicated example, very much akin to the internal GEOS-5 
treatment of cloud overlap, would be the following “homogeneous cloud, maximum-random 
overlap” generator: divide the atmosphere into pressure bands, e.g., low, middle and high 
bands, with interfaces at 700 and 400 hPa. Say we again wish to generate N subcolumns, n = 
1, for the gridcolumn. Then for each pressure band, generate a set of N uniform random 
numbers {r n } on [0,1], and for each model layer k falling within the band, assign cloudiness to 
layer k of subcolumn n if r n < /)-, where fk is that layer’s cloud fraction. The fact that the same 
set {r„} is used for each layer k in the band enforces maximum cloud overlap within the band. 
But choosing independent sets of {r n } for each pressure band enforces random overlap 
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between the bands. Finally, every snhcolnmn which is cloudy at layer k, shares the same 


homogeneous in-cloud condensate contents — {,J)k , where q( h i)k are the layer mean condensate 

fk 

contents (i=ice, l=liquid water). 

Note that this simple generator, as with the earlier more sophisticated generator, produces 
subcolumns of condensate. The specification of optical thicknesses from condensate contents 
proceeds in the same way in both generators, as presented earlier. We emphasize this strategy 
because the reader should be very aware of the potential traps associated with using 
diagnostic layer cloud optical thicknesses directly from GCM (e.g., GEOS-5) output files. 
When using diagnostic layer cloud optical thicknesses directly, one must know whether they 
are in-cloud or “layer mean”, and if in-cloud, for what cloud fraction. For example, for 
GEOS-5, the layer cloud optical thicknesses TAUCLW and TAUCLI are “in-cloud” values 
consistent with the maximum cloud fraction f max of the layer’s pressure band, not with the 
layer’s actual cloud fraction fk. This is because the GEOS-5 diagnostics internally dilute each 
layer’s in-cloud cloud optical thickness (in an approximately radiatively-consistent manner) 
by stretching each layer’s cloud fraction to its band’s f max , in order that it may simply add the 
diluted layer “in-cloud cloud optical thicknesses” within a band to produce a “band in-cloud 
cloud optical depth”, such as GEOS-5’s “TAULOW”. Because of this, the GEOS-5 
diagnostic TAULOW, for example, can be regarded as the low-band in-cloud cloud optical 
thickness consistent with the model’s low band cloud fraction CLDLOW. Similarly for 
TAUMID and TAUHGEF But note that one cannot simply add TAULOW, TAUMID and 
TAUHGE1 to get a column in-cloud optical thickness, because the actual column value 
depends on the overlap of these bands. Currently this overlap is random, and so we could 
express the column in-cloud optical thickness in terms of the sum over the 2 3 combinations of 
cloud/clear in the three bands, each with their respective fractions (e.g., a fraction 
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CLDLOW*CLDMID*CLDHGH of the gridcolumn would have a column TAU of 
T AULO W+T AUMID+T AUHGH, a fraction CLDLOW*CLDMID*( 1 -CLDHGH) would 
have a column TAU of TAULOW+TAUMID, etc.), and finally all normalized by the column 
cloud fraction CLDTOT = 1 - (l-CLDLOW)*(l-CLDMID)*(l-CLDHGH). 

Not surprisingly this gives 

CLDLOW * TAULOW +C-LDMID * TAUMID +C'LDHGH * T AUHGH 

TAUTOT = 

CLDTOT 

for the column in-cloud optical thickness, because we are assuming that we can average 
optical thickness (including zeros) in the horizontal, to get a layer mean optical thickness. 
That assumption in itself is rather questionable, since cloud radiative properties are non-linear 
in optical thickness. It is therefore much more accurate to deal with radiative averages over 
subcolumn ensemble optical thicknesses, generated as we have described, than with “layer 
mean” or “band in-cloud” optical thicknesses directly. 

The point we are ultimately making is that one cannot simply interpret the column 
consequences of model layer cloud diagnostics without a knowledge of the model’s cloud 
overlap. This is why GEOS-5 now includes the COSP simulator suite to produce satellite 
observables with an inbuilt treatment of the model overlap. 

2.4 Radiative transfer calculation 

Now that we’ve collected all the necessary information about atmosphere and cloud 
layers, we begin the simulation process. The radiative transfer calculations were performed 
using the Discrete Ordinate Radiative Transfer (DISORT) code (Stamnes et al. 1988) with 
liquid water cloud phase function results from Mie calculations based on gamma distribution 
water droplet size distributions with an effective variance of 0.1 and bulk ice cloud phase 
models developed by Baum et al. (2005), both consistent with MOD06. We have 
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experimented with a different number of computational streams in order to balance speed and 
desired accuracy. We found that only 16 streams were required to achieve the needed 
precision. Generally a large number of streams is required to accurately model the forward 
peak of the phase function and multiple scattering components. Initial calculations were done 
with 32 streams, however the execution time was rather prohibitive. We settled on 16 streams 
as a balance between execution time and precision as the difference in resulting equivalent 
sensor radiance between 32 and 16 stream simulations was less than 0.5%.. Also the forward 
peak is further truncated and use of the delta-fit method of Hu et al. (2000) can be considered 
sufficiently accurate, as described by Ding et al (2009), for calculations where there is no 
stored accuracy limit such as the multilayer cloud simulations in Wind et al (2010). As we 
pre-calculate the surface spectral albedos, we can save further time by calling DISORT in 
Lambertian mode with predetermined values. When we encounter cloud subcolumns over the 
ocean, however, we must adjust the computed Cox-Munk surface albedo to compensate for 
the diffuse illumination that the presence of the cloud creates. A good value for the diffuse 
illumination albedo of a water surface is 0.05 (Platnick et al. 2003). We then linearly fit 
surface albedo as a function of cloud optical thickness, with full diffuse illumination at a total 
column cloud optical thickness of 3 and full Cox-Munk surface albedo at total column cloud 
optical thickness of 0. 

3 Example retrievals 

In this section we discuss two example results of radiance simulations and subsequent 
cloud property retrievals. We performed the simulation on the NASA Center for Climate 
Simulations (NCCS) Discover system using 12 Intel Westmere nodes with 12 cores each. The 
memory footprint of the software suite is very small, around 80 Mb peak usage, but the 
process is quite CPU-heavy. A full-resolution 1 km simulation using a full MODIS granule as 
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a study area took about three and a half hours wall clock time to complete. Figure 1 shows 
resulting true-color RGB image of sample MODIS granule 2012 day 228 (August 15 th ) at 
12:00 UTC together with the true-color image of the actual MODIS granule before its channel 
data was replaced. Panel a) shows the actual data acquired by Aqua MODIS and panel b) 
shows the simulation result. GEOS-5 does not assimilate cloudy radiances and so there should 
be little expectation of a granule-level feature match. However, in this case the model does 
remarkably well with cloud placement. Bands of cloud over southern France are present and 
located properly, as are clouds over the northern Balkans and southern Asia Minor. The 
orographic clouds over Italy and Greece are also present, as are scattered clouds over the 
Sahara desert. There are some rather important differences between the model and the actual 
data, however, when it comes to cloud properties. 

Figure 2 shows the results of running the Data Collection 5.1 operational retrieval chain 
on the resulting LIB file from model fields. Panel a) shows the cloud thermodynamic phase, 
panel b) the cloud top pressure, panel c) the cloud optical thickness and panel d) the cloud 
effective radius retrieved with the VNSWIR (Visible, Near- or ShortWave InfraRed) and 
2.1pm channel combination. Figure 3 shows the actual Aqua MODIS retrieval for that same 
granule using identical panel arrangement. 

The cloud field over the central Mediterranean is given improper vertical location by the 
model. The actual cloud field is retrieved as liquid water and is low cloud, with cloud top 
pressures of 800-900mb. The model generates a thin cirrus cloud in that location with cloud 
top pressure of about lOOmb and of course ice thermodynamic phase. This has serious 
implications for outgoing radiation The cloud field over Romania has a consistent phase, but 
the model indicates the cloud to be positioned somewhat higher in altitude than the 
observation and also significantly optically thicker than what is observed. The same is true for 
the cloud field over NW Turkey. 


15 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 


Figure 4 shows a cloud top pressure / cloud optical thickness joint histogram for the 
actual granule in panel a) and simulated one in panel b). While in this comparison we are not 
necessarily looking for quantitative evaluation of model parameters, some things do tend to 
jump out. The actual MODIS granule has mostly low clouds that are moderately thick. The 
simulated granule on the other hand lacks low clouds almost entirely and instead produces 
thicker clouds at high altitude. The RGB images look very similar in this case, so the model is 
performing well on geographical cloud placement, but fails rather badly when it comes to 
proper cloud placement in altitude. This kind of disconnect can have some significant 
implications for Earth radiative budget calculations. It is one of our future goals to determine 
just how frequently such disconnects occur on the global scale. 

Figures 5 through 8 show another simulation example, this time from Terra MODIS 
2013 day 151 (05.31.2013) at 11:15UTC. Figure 5 shows a true-color RGB image for 
simulated and actual MODIS granule together with the SWIR composite that allows user to 
visually estimate cloud thermodynamic phase. Ice clouds appear red in such image. We 
cannot show a SWIR composite for Aqua MODIS because of detector issues with the Aqua 
MODIS 1.6pm channel that is needed to create the image. Figure 6 shows retrieval results for 
the simulated granule, figure 7 shows the actual Terra MODIS granule retrievals. Figure 8 
shows the joint histograms of cloud top pressure and cloud optical thickness. In this case there 
is actually reasonable agreement between sensor measurement and model cloud field 
representation both geographically and vertically. The model could have benefitted from 
producing somewhat more mid-level clouds, but overall the large convective system that 
dominates the scene is represented reasonably well, as are the broken clouds around it. In this 
case the MODIS operational cloud mask had some trouble detecting clouds in the sun glint 
region, but sun glint quite often can be a challenging area for retrieval algorithms. 

The MODIS cloud top pressure retrieval is quite sensitive to ancillary atmospheric 
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profile information (Menzel et al 2008) and some of differences found in retrievals could be a 
result of different representations of the atmospheric profile by GEOS-5 and the NCEP 
Mesoscale Meteorological Model (MM5)-derived model profiles used during the MODIS 
retrieval. 

Situations in which cloud optical thickness retrievals show significant differences tend to 
be more indicative of significant differences in cloud structure. Unlike cloud top pressure, 
cloud optical thickness retrievals have very little dependence on atmospheric profile 
information as there is very little atmospheric absorption in the 0.65 and 0.86pm channels 
used to retrieve this quantity 

Cloud effective radius retrievals from the 2.1pm channel depend somewhat on the 
atmospheric profile, but differences in that retrieval are also mainly due to differences in 
cloud microphysics present in the model and in the actual atmosphere. Retrieved cloud 
effective radius appears to be somewhat smaller overall for GEOS-5 data than for MODIS. 
Even though the clumped-ICA cloud formation method allows us to model some of the scene 
inhomogeneity normally encountered in actual MODIS data, the present implementation of 
the simulator does not admit sub-pixel-based effective radius artifacts such as ones appearing 
in MODIS (Zhang and Platnick, 2011). Also, GEOS-5 uses a simple diagnostic prescription 
of cloud effective radius that is loosely based on large scale observations of aerosol 
concentrations and their physical connection to cloud droplet size, and with the details being 
adapted based on consistency with surface radiation budget estimates of shortwave cloud 
forcing. It is therefore not surprising that, in our preliminary results, there appears to be 
generally somewhat less variability in retrieved cloud effective radius from GEOS-5 than 
from real MODIS data. 

4 Conclusions and future directions 

We have developed a flexible software suite that allows us to interface model fields to 
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operational satellite remote sensing retrieval algorithms. We have presented an example of its 
operation using the GEOS-5 model and MODIS instrument. We have demonstrated the power 
of this software in locating and quantifying problems with GEOS-5 cloud optical properties 
and cloud vertical distributions within the specific geographic and synoptic contexts observed 
in MODIS cloud granules. 

In subsequent papers we will show a number of applications of this software. Our current 
plans include performing a number of large-scale simulation experiments using GEOS-5 
nature run model data with resolutions as high as 7 km. We would like to examine aerosol- 
cloud interactions by performing simulations with and without model aerosol fields. Once 
operational cloud and aerosol retrieval algorithms are applied to such data, we may be able to 
quantify some aerosol effects on clouds and maybe even find some ways to retrieve aerosols 
above clouds. 

We would also like to examine in detail the performance of COSP, the Cloud Feedback 
Model Intercomparison Project (CFMIP) Observation Simulator Package, which provides a 
means of simulating retrieved cloud properties from a variety of instruments, including 
MODIS, from model fields. The COSP package typically uses approximate, so-called “fast” 
simulators of cloud properties, whereas we are directly simulating radiances and performing 
the retrievals using actual operational retrieval codes. We thus have the ability to test the 
quality of the COSP fast simulators 

We also intend to apply this software suite to other remote sensing instruments, such as 
SEVIRI (Spimiing Enhanced Visible Infrared Radiometer Imager) onboard the Meteosat 
Second Generation geostationary satellite series. SEVIRI has a sufficient number of channels 
that MODIS-style cloud retrieval algorithms can be applied. SEVIRI acquires data in 15 
minute intervals at 3 km nadir resolution, thus giving fine temporal and spatial resolution for 
diurnal sampling. Applying the equivalent sensor radiance package to SEVIRI would allow us 
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potentially to examine model cloud dynamics at a temporal resolution not offered by polar 
orbiting satellites. 

We are confident that there are many other applications for this software that will be 
found in the future besides ones outlined above and that it will become a valuable tool for 
both the remote sensing and modeling communities. 

The simulator code is available to users free of charge by contacting the authors and 
becoming a registered user of this software package so that any updates can be issued directly. 
There may be additional, wider distribution means in the future if the software shows signs of 
growing popularity. 
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1 Table 1 . GEOS v.5.7.2 fields and products used in simulations 


Field Code 

Description 

U10M 

U-component of wind speed at 10m altitude 

V10M 

V-component of wind speed at 10m altitude 

FRSEAICE 

Sea ice fraction 

FRSNO 

Snow fraction 

PS 

Surface pressure 

T2M 

Temperature at 2m altitude 

SLP 

Mean sea-level pressure 

QV2M 

Specific humidity at 2m altitude 

03 

Ozone concentration profde 

T 

Temperature profile 

DELP 

Level pressure differential profile 

RH 

Relative humidity profile 

CLOUD 

Radiative cloud fraction profde 

QLLS 

Large scale cloud liquid water mixing ratio 

QLAN 

Anvil cloud liquid water mixing ratio 

QILS 

Large scale cloud ice mixing ratio 

QIAN 

Anvil cloud ice mixing ratio 


2 

3 
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1 Table 2. MODIS channels used in simulations 


Channel number 

Central wavelength (tint) 

1 

0.65 

2 

0.86 

3 

0.47 

4 

0.55 

5 

1.24 

6 

1.63 

7 

2.13 

8 

0.41 

9 

0.44 

17 

0.91 

18 

0.94 

19 

0.94 

20 

3.7 

22 

3.9 

26 

1.38 

27 

6.2 

28 

7.3 

29 

8.5 

31 

11.0 

32 

12.0 

33 

13.2 

34 

13.4 

35 

13.8 

36 

14.2 


2 

3 
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1 Table 3. Vertical levels used in simulation 


Level number 

Level altitude (km) 

1 

80 

2 

60 

3 

50 

4 

45 

5 

40 

6 

35 

7 

30 

8 

25 

9 

20 

10 

18 

11 

16 

12 

15 

13 

14 

14 

13 

15 

12 

16 

11 

17 

10 

18 

9 

19 

8 

20 

7 

21 

6 

22 

5 

23 

4 

24 

3 

25 

2 

26 

1 

27 
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a) Actual RGB composite 


b) Simulated RGB composite 



1 

2 Figure 1. Equivalent sensor radiance simulation together with an actual MODIS granule that 

3 was used as study area. Aqua MODIS granule 2012 day 228 at 12:00 UTC. GEOS-5 temporal 

4 fit between 12Z and 18Z 2012.08.15. RGB (0.67pm, 0.55pm, 0.47pm) 
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a) Cloud Thermodynamic Phase 



■ Ocean ■ Liquid Water Cloud 

■ Land Ice Cloud 


| Undetermined 
c) Cloud Optical Thickness 



b) Cloud Top Pressure 
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Figure 2. Example cloud retrieval for simulated granule covered by Aqua MODIS 2012 day 


288 12:00 UTC. Panel a) is cloud thermodynamic phase, panel b) is cloud top pressure, panel 


c) is cloud optical thickness and panel d) is cloud effective radius from the 0.86-2. lpm band 


combination 
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Figure 3. Actual cloud retrieval for Aqua MODIS 2012 day 228 at 12:00 UTC . Panel a) is 
cloud thermodynamic phase, panel b) is cloud top pressure, panel c) is cloud optical thickness 
and panel d) is cloud effective radius from the 0.86-2. lpm band combination. 
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a) Aqua MODIS CTP/COT Joint Histogram 


b) Simulated CTP/COT Joint Histogram 



Figure 4. Joint histograms of cloud optical thickness vs cloud top pressure for actual (a) and 
model-based (b) cloud fields covered by Aqua MODIS 2012 day 228 at 12:00 UTC. 
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a) Actual RGB composite 


b) Simulated RGB composite 




c) Actual SWIR composite 


d) Simulated SWIR composite 




2 

3 Figure 5. Equivalent sensor radiance simulation together with an actual MODIS granule that 

4 was used as study area. Terra MODIS granule 2013 day 151 at 11:15 UTC. GEOS-5 temporal 

5 fit between 06Z and 12Z 2013.05.31. RGB (0.67pm, 0.55pm, 0.47pm), SWIR(0.86pm, 

6 1.6pm, 2.1pm) 
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Figure 6. Example cloud retrieval for simulated granule covered by Terra MODIS 2013 day 
151 11:15 UTC. Panel a) is cloud thermodynamic phase, panel b) is cloud top pressure, panel 
c) is cloud optical thickness and panel d) is cloud effective radius from the 0.86-2. lpm band 
combination 
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Figure 7. Actual cloud retrieval for Terra MODIS 2013 day 151 at 11:15 UTC . Panel a) is 
cloud thermodynamic phase, panel b) is cloud top pressure, panel c) is cloud optical thickness 
and panel d) is cloud effective radius from the 0.86-2. lpm band combination. 
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a) Terra MODIS CTP/COT Joint Histogram 


b) Simulated CTP/COT Joint Histogram 



Figure 8. Joint histograms of cloud optical thickness vs cloud top pressure for actual (a) and 
model-based (b) cloud fields covered by Terra MODIS 2013 day 151 at 11:15 UTC. 
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